Outliers and influential observations can have an important effect on work with estimation and inference from establishment survey data. Practical development and implementation of methods to identify and account for outliers and influential observations in complex survey data
نویسنده
چکیده
Outliers and influential observations can have an important effect on work with estimation and inference from establishment survey data. Practical development and implementation of methods to identify and account for outliers and influential observations in complex survey data require an agency to balance several factors, including: (i) the mathematical statistics properties of detection methods and prospective estimators; (ii) a range of objective functions that include traditional measures like variance and mean squared error, as well as other functions tuned to reduction of risks associated with very rare extreme observations and estimates; (iii) information available on the underlying populations of interest; (iv) cost structures; and (v) important constraints on production systems and modification thereof. This in turn has three practical implications for research on outlier methods for establishment surveys. First, the predominant outlier literature in mathematical statistics has focused primarily on area (i). Consequently, it is important to expand our mathematical structure and objective functions to account for factors (ii)-(v). Second, methods that may appear to be inefficient or inadmissible under evaluation criteria in the traditional literature may warrant serious consideration under the more complex structure defined by (i)-(v). Third, the structure defined by (i)-(v) presents an opportunity to enrich and deepen the current mathematical statistics literature on outliers. In keeping with previous requests from FESAC, this paper focuses primarily on research that is at various stages of development, rather than on finalized research results.
منابع مشابه
Detection of Outliers and Influential Observations in Linear Ridge Measurement Error Models with Stochastic Linear Restrictions
The aim of this paper is to propose some diagnostic methods in linear ridge measurement error models with stochastic linear restrictions using the corrected likelihood. Based on the bias-corrected estimation of model parameters, diagnostic measures are developed to identify outlying and influential observations. In addition, we derive the corrected score test statistic for outliers detection ba...
متن کاملIdentification of outliers types in multivariate time series using genetic algorithm
Multivariate time series data, often, modeled using vector autoregressive moving average (VARMA) model. But presence of outliers can violates the stationary assumption and may lead to wrong modeling, biased estimation of parameters and inaccurate prediction. Thus, detection of these points and how to deal properly with them, especially in relation to modeling and parameter estimation of VARMA m...
متن کاملDiagnostic Measures in Ridge Regression Model with AR(1) Errors under the Stochastic Linear Restrictions
Outliers and influential observations have important effects on the regression analysis. The goal of this paper is to extend the mean-shift model for detecting outliers in case of ridge regression model in the presence of stochastic linear restrictions when the error terms follow by an autoregressive AR(1) process. Furthermore, extensions of measures for diagnosing influential observations are ...
متن کاملروشهای تعیین دادههای پرت در مطالعات پزشکی
Background: An outlier is an observation that lies an abnormal distance from other values in a random sample from a population. Outliers sometimes deal with to abnormality in obtained results from collected data and information. known outlier data by researchers, physicians and other persons that work in medical fields and sciences is important and they must control data before getting result a...
متن کاملIntroduction Package CircOutlier For Detection of Outliers in Circular-Circular Regression
One of the most important problem in any statistical analysis is the existence of unexpected observations. Some observations are not a part of the study and are known as outliers. Studies have shown that the outliers affect to the performance of statistical standard methods in models and predictions. The point of this work is to provide a couple of statistical package in R software to identi...
متن کامل